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Abstract. Countable state Markov shifts are a natural generalization of the 
well-known subshifts of finite type. They are the subject of current research 
both for their own sake and as models for smooth dynamical systems. In this 
paper, we investigate their almost isomorphism and entropy conjugacy and ob- 
tain a complete classification for the especially important class of strongly posi- 
tive recurrent Markov shifts. This gives a complete classification up to entropy- 
conjugacy of the natural extensions of smooth entropy-expanding maps, e.g., 
C°° smooth interval maps with non-zero topological entropy. 
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1. Introduction 

In this paper, "Markov shift" means a countable (possibly finite) state irreducible 
Markov shift. We use the same symbol to denote the domain of a Markov shift and 
the shift map on this domain. There are several characterizations (|2.3I) of the class 
of strongly positive recurrent (SPR) Markov shifts; this is the class of Markov shifts 
which most resemble finite state Markov shifts. 

A map if : S — > T between Markov shifts is a one-block code if there is a function 
<E> from the symbol set of S into the symbol set of T such that ((px) n — for 
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all x and n. A T-word W (of length \W\) is a magic word for such a map ip if the 
following hold. 

(1) If y G T and {n G Z : y[n,n + \ W\ — 1] = VK} is unbounded above and 
unbounded below, then y has a preimage under tp. 

(2) There is an integer I such that whenever C is a T-word and two points x 
and x' of S satisfy (c^x) [0, 2 1 W | + 1 C\ - 1] = WCW = (ipx')[0, 2 1 W | + 1 C| - 1] , 
then x[J,/ + |W| + |C| - 1] = x'[I, 1 + \W\ + \C\ - 1]. 

(In the constructions of this paper, the integer 7 of the last condition will be zero.) 
It follows from (J2J) that the preimage in Q is unique. 

We define two Markov shifts S and T to be almost isomorphic if there exist 
another Markov shift R and injective one-block codes R — > 5 , R — > T each of which 
has a magic word (recall that all Markov shifts are understood to be irreducible). 
An almost isomorphism will induce a shift-commuting, Borel bimeasurable bijection 
between the images of the one-block codes, and thereby will induce isomorphisms 
of measurable systems (S, fx) — > (T, v) for shift-invariant Borel probabilities fi, v 
which assign these images measure 1. The collection of such measures will include 
the ergodic measures which give positive measure to every nonempty open set, and 
in the SPR case will include all ergodic measures with entropy sufficiently close 
to the topological entropy (defined l|2.1(l as the supremum of the measure-theoretic 
entropies with respect to invariant Borel probabilities). These isomorphisms will be 
finitary (homeomorphisms between measure-one sets) , with exponentially fast cod- 
ing time for exponentially recurrent ergodic measures (such as the unique measure 
of maximal entropy, in the SPR case). 

Our main result (|6.4[) is that SPR Markov shifts are almost isomorphic if and 
only if they have the same entropy and period. This can be viewed as an analogue 
of the Adler-Marcus classification of irreducible shifts of finite type up to almost 
topological conjugacy by entropy and period (|8.2|l . The classification beyond SPR 
cannot possibly have the same simplicity l|3.6|l . We also give a sufficient condition 
for almost isomorphism of not necessarily SPR Markov shifts (|fi.2|l . 

Two systems are entropy conjugate if they are Borel conjugate after restriction to 
sets which have full measure for all ergodic measures with entropy near the topolog- 
ical entropy (see Definition I3.3|) . Various smooth, piecewise smooth and symbolic 
systems are known (Theorem 17.111 to have natural extensions which are entropy 
conjugate to a finite union of SPR Markov shifts. Consequently, our main result 
provides simple invariants which classify the natural extensions of these systems up 
to entropy conjugacy fTheorem l7.2|l . 

One motivation of our paper is recent interest in the thermodynamic formal- 
ism for countable state Markov shifts [TOI [13 E21 E31 EH EH SU EHI E01 E3 In 
a companion paper [2|, we describe a reasonable class of Borel potentials which 
behaves well under almost isomorphism. We are also motivated by the use of 
countable state Markov shifts to code some partially or piecewise hyperbolic sys- 
tems (0ElElESlESlEZIEniEniEniE7|EniEland successors of El); investigations 
of coding relations among Markov shifts (see ^1 El H41 1151 Ho] and their refer- 
ences); and a longstanding finite state coding problem for Markov measures not of 
maximal entropy l|8.4|l . 

This paper is dedicated to Klaus Schmidt and Peter Walters, on the occasion of 
their sixtieth birthdays. Words from the poet E.E.Cummings JT] suit them: 
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septembering arms of year extend 
less humbly wealth to fool and friend 

2. Definitions and background 

Let A be a square matrix with nonnegative integer entries, with rows and columns 
indexed by a finite or countable set X. View A as the adjacency matrix of a 
directed graph with some set of edges Sa- Give Ea the discrete topology; give 
(£a) Z the product topology; let Sa denote the set of doubly infinite sequences on 
Ea corresponding to walks through the graph; and let Sa have the topology it 
inherits as a subspace of {£a) Z - The shift map homeomorphism on Sa is defined 
by sending a bisequence (x n ) to the bisequence (y n ) such that y n — x n +\ for all n. 
Given a point/bisequence x, we let x[i, j) denote the word XiXi+i . . . Xj. If W is a 
path of n edges (or a finite sequence of n symbols), then \W\ denotes n. We will 
use the same symbol (e.g. Sa) for the shift map and its domain. Sa is uniquely 
determined by A, up to the naming of the edges in the graph; in this paper, we 
can safely ignore this ambiguity, and refer to "the" Markov shift defined by A. We 
use the edge shift presentation rather than the vertex shift presentation, but this 
paper could have been written entirely with the vertex shift presentation. 

The matrix A is irreducible if for every row i and column j there exists n > 
such that A n {i 1 j) > 0. (The term "indecomposable" as used in [21] is synonymous 
with our "irreducible" .) Convention: in this paper, by a Markov shift we will mean 
a homeomorphism Sa defined by a (finite or countably infinite) irreducible matrix 
A over Z + . 

By a graph we will always mean a directed graph, with finitely or countably 
infinitely many vertices and edges. Given a vertex v in a graph, a first return loop 
to v is a path of edges which begins and ends at v and otherwise does not visit v. 
The number of edges in a path p is its length, \p\. A loop graph (in |24j . a petal 
graph) is a graph Q with a distinguished vertex v, such that Q is the union of the 
first return loops to i>, and every vertex except v lies on a unique first return loop. 
From here, for brevity by a loop in a loop graph we will always mean a first return 
loop to the distinguished vertex. 

Let / be a power series Y^n=i fnZ n with nonnegative integer coefficients, i.e. 
/ 6 zZ + [[z}]. We will sometimes use the notation rad(/) to denote the radius of 
convergence of /. A loop graph Qf for / is a loop graph which for each n contains 
exactly /„ loops of length n. The loop graph is uniquely determined by / up to the 
naming of the edges and vertices. Given Qf, the loop shift o~f is the Markov shift 
Sa such that A is the adjacency matrix of a loop graph for /. 

Suppose Sa is a Markov shift, with associated graph Q(A), and v is a vertex in 
Q(A). Let f n denote the number of first return loops of length n to v, and assume 
f n < oo for all n (a property which is independent of the choice of vertex). Let 
o~ f be the corresponding loop shift. The period of Sa (or of 07) is the g.c.d of 
{n : f n > 0}, and does not depend on the choice of vertex. Sa is mixing if and only 
if the period is 1. 

Definition 2.1. By the entropy h(SA)(— h(o~f))j we will mean the Gurevich entropy 
[2"T] , which is the supremum of the measure theoretic entropies over invariant Borel 
probabilities. This entropy equals log(lim Irnl 1 /"), where r n is the number of loops 
(not necessarily of first return) at an arbitrary vertex, and the limsup here is a limit 
when Sa is mixing. 
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Let log(A) denote the entropy of Sa and suppose < log(A) < oo. Four basic 
classes can be described in terms of the sequence (/„). 

(1) S A is transient if £ /„/A" < 1 

(2) S A is recurrent if £ /n/A™ = 1 

(3) Sa is positive recurrent if ^2f n /X n = 1 and ^nf n /\ n < oo 

(4) Sa is strongly positive recurrent (SPR) if Urn l/nl 1 /™ < A. 

(We are using names of matrix classes to describe the Markov shifts they define.) 
Here (4) =4> (3) ==> (2). The properties (l)-(4) don't depend on the particular 
choice of vertex [21]. The classes (l)-(3) correspond to the classical Vere- Jones 
classification jSUll^ of N x N nonnegative matrices. A finite entropy Markov shift 
is positive recurrent if and only if it has a measure of maximal entropy \x (i.e., 
h(S) = h(S,fx)), in which case there is only one measure of maximal entropy |21| . 

Markov shifts defined over a countable alphabet may differ significantly from 
those which have a finite alphabet (subshifts of finite type), see e.g. [121 1241 l3*T1 
[SU][S2]. The search for well-behaved Markov shifts leads to the class of SPR shifts. 
The notion of SPR already appeared in Vere- Jones' work in the special case 
of stochastic matrices as a necessary and sufficient condition for some exponential 
convergence. Following work of Salama (see |45U46ll4*7) ) on conditions for the strict 
decrease of entropy in proper subsystems, the class of SPR shifts was introduced 
independently by Ulf Fiebig JU| and by Gurevich [221 ^ wn0 introduced the class 
of "stably recurrent" matrices, which contain those defining what we call the SPR 
Markov shifts. In [23, this class is developed further as the fundamental class 
of "stable positive" matrices [21] • We view "SPR" as also abbreviating "stable 
positive recurrent" . 

We will record some of the conditions on a Markov shift which are equivalent to 
SPR. For one, we first recall the definition of exponential recurrence while remarking 
some equivalences. 

Remark 2.2. Suppose (S, f£) is an ergodic automorphism of a topological space with 
a Borel probability measure. Let V be a measurable set such that ixV > 0. Let ry 
be the return-time function on V, ry(x) = min{n > : S"x £ V}. By ergodicity 
the system (S,/j,) is isomorphic to a tower over V with return time function ry. ft 
is not difficult to check that the following are equivalent: 

(1) (S,/i) is exponentially recurrent: i.e. for every open set V with fj,V > 0, 

lim (fi{x eV:r v (x)> n}) 1/n < 1 . 

(2) (S, /i) is exponentially filling: i.e. for every open set V with /iV > 0, 

fim"(^{2; £S:^ U^ =1 S' k V}) 1/n < 1 . 

(3) (S 1-1 ,/!) is exponentially recurrent. 

Proposition 2.3. The following are equivalent conditions on a Markov shift S . 

(1) S is SPR. 

(2) Removing any edge from the graph defining S strictly lowers the entropy 
(H23, [13,122 Remark 3.16]; see 

(3) Some (equivalently every) local zeta function (counting the number of fixed 
points x £ [v] for some vertex v ) has a non-trivial meromorphic extension 
(see [21] and Section^. 
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(4) S has a measure of maximal entropy \x, and (S, fj,) is exponentially recurrent. 

Proof. We will prove l@J 0J. Note that both conditions imply that there 

exists a measure of maximum entropy and recall that such a measure gives positive 
measure to every non-empty open subset |21j . We may and do assume that we have 
such a measure fj,. 

Let Vj be the open set of points x such that the edge xq begins at a given vertex 
I. Let /„ be the number of first return loops to /. By the specific form for /z |31II21| . 
there is a constant cj > such that, for any first return loop Wq . . . W n —x, the set 
[W] := {x : xo-'-Zn-i = W} has /i-measure c/(l/A)™. Thus fi(S \ U% =1 S n Vi) 
goes to zero exponentially fast, i.e. Vj is exponentially filling in (S, /i), if and 
only if XmLjv+i Cl ( n ~ N)(l/\) n goes to zero exponentially fast. But the latter is 
equivalent to the SPR property. 

To conclude, it is enough to see that SPR implies not only exponential filling for 
the sets Vi for any vertex /, but for all non-empty open sets V. Any such V contains 
a cylinder set C = {x : x[—k, k] = W} for some word W, and W becomes a vertex 
in a higher block presentation S' of S. Because S' is again SPR 0] Propostion 
2.12], C is exponentially filling in S' , and therefore so is V in S. □ 

For more background, see and their references for Markov shifts and 

nonnegative matrices; [HSj for entropy and ergodic theory; and [2103 for symbolic 
dynamics. 

3. Magic words, almost isomorphism and entropy-con jugacy 

We assume the definitions already given in the Introduction. Given a subset K 
of a Markov shift S, we let M.(K) denote the set of shift-invariant Borel probabil- 
ities fi on S such that /J.K = 1. Let K' be a subset of another Markov shift S'. 
Given a shift-commuting Borel automorphism 7 : K — > K', we may use the same 
symbol 7 for related maps such as 7 : M(K) — > Ai(K'). Recall a measure on a 
space has full support if it is nonzero on every nonempty open set. If /1 is a shift 
invariant measure on S, then it defines a measure-preserving system (S, fi). An iso- 
morphism of measure preserving shift systems (S,fi) — » (S',fi') is a bimeasurable, 
shift-commuting, measure-preserving bijection from a subset K of S to a subset K' 
of S", where fiK = fi'K' — 1. Such an isomorphism is finitary if K and K 1 can be 
chosen so that the map K — > K' is a homeomorphism, with respect to the topolo- 
gies K and K' inherit as subsets of the topological spaces S and S'. In this case, 
for ^-almost all x, there exists a minimal nonnegative integer n = n(x) such that 
for /i-almost all points x' with x'[— n, n] = x[— n, n], we have (yx)o = (yx')o- (In 
other words, /i-almost surely ip is a variable length block code.) We then define the 
expected coding time of <p to be J s n(x)dp>. We say ip has exponentially fast coding 
if limfe \/i{x : n(x) > kW 1 ^ < 1; this property implies the expected coding time is 
finite. We say systems (S,/jl) and (S',fx') are finitarily isomorphic if there exists a 
finitary isomorphism between them. We say following [S3] that a measurable 
isomorphism is hyperbolic structure preserving if on sets of full measure it respects 
the stable and unstable relations (here, the relations of being forwardly/backwardly 
asymptotic under the shift). 

For S, S' Markov shifts with given invariant measures /i h //, a magic word 
isomorphism is a finitary isomorphism 7 : (S, (a) — * (S',fjf) such that both 7 and 
7 _1 have a magic word. The definition of magic word here is the same as in the 
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introduction, with the modifications that conditions (1) and (2) are required to 
hold only on sets of full measure, and the set of points seeing the magic word W in 
infinitely many positive and negative coordinates is required to have full measure. 
The latter condition is automatic for an ergodic measure assigning positive measure 
to the set {x : x[0, \W\ — 1] = W}. (We remark that the definition of magic word 
isomorphism in |20) is slightly different; a magic word here for 7 is a magic word in 
d3| for 7- 1 .) 

Proposition 3.1. Suppose 7 : (S,fi) — + (S',fj,') is a magic word isomorphism be- 
tween Markov shifts endowed with invariant and ergodic probability measures. Then 
7 is a hyperbolic structure preserving isomorphism. If the systems are exponentially 
recurrent, then 7 codes exponentially fast. 

Proof. With probability one, points will see a magic word infinitely often in positive 
and in negative coordinates. Therefore, the left tail x{— 00, 0] of a bisequence x will 
code some left tail y(— 00, to] of its image, and likewise for right tails. This proves 
the a.e. preservation of hyperbolic structure. 

Now suppose the systems are exponentially recurrent. Let V be an S'-word which 
is a magic word for 7" 1 . Without loss of generality, let the index / used in the 
definition of a magic word in the Introduction be zero. Let J be the length of V. 
For n > J, let E n be the set of S-points x such that V is not a subword of x[—n, —1] , 
and let F n be the set of S'-points x such that V is not a subword of x[l,n]. Then 
x[—n,n] codes (<px)o except for a subset of E n U F n . Let V also denote the set 
{x : x[0, J -I] = V}. We have 

F n = {x : x i U n k Z(S- k (V)} , E n = {x:x£\J n k z(S\S J V)} 

and it follows from Remark 12.21 that 

151 U{E n U F n )) 1/n < En" UE n + ^F n ) V " < 1 . 

□ 

Definitions 3.2. By a measurable system we will mean a measurable map T : X — » 
X. In this paper, the underlying er-algebra will be the Borel er-algebra naturally 
associated to X , and the entropy h(T) is defined as the supremum of the measure 
theoretic entropies with respect to T-invariant Borel probabilities. For a measurable 
system T : X — > X, a subset N C X is called entropy-negligible if there is 
h < h(T) such that fJ-(N) = for every ergodic invariant probability measure /i 
with h(T, fi) > h. 

Definition 3.3. Two measurable systems T : X — > X and S : Y — > Y are entropy- 
conjugate if there exist entropy-negligible subsets Xq C X and Yq C Y and a 
bimeasurable bijection 7 : X \ Xq — ► Y \ Yo such that Sj — 7T for all x £ X \ Xq. 
Such a map 7 is called an entropy- conjugacy. 

Proposition 3.4. Suppose S and T are almost isomorphic Markov shifts. Then 
h(S) = h(T), and there are Borel subsets K and K' of S and T, and a shift- 
commuting Borel- measurable bijection 7 : K — > K' , such that the following hold. 

(1) K and K' are residual subsets of S and T (contain dense Gs sets). 

(2) The map 7 induces a bijection A4(K) — > A4(K') (fi 1— > // , say) such that for 
each such pair //, \j! the map 7 induces an isomorphism 7 : (S, /i) — ► (T, jj,'), 
which is a magic word isomorphism when /1 and \j! have full support. 
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(3) M(K) and Ai(K') contain all ergodic shift-invariant Borel probabilities on 
S and T with full support, and these correspond under 7. 

(4) If S is SPR, then so is T, and 7 is an entropy- conjugacy from S to T. 

Proof: Suppose S and T are almost isomorphic Markov shifts, i.e., suppose we 
have another Markov shift R and injective one-block codes ip : R — > S and ip ■ R — > 
T, with magic words W and W respectively. We assume for simplicity that 1 = 
in their definition. 

Let W*, resp. W*, be some i?-word projecting to W, resp. W. Pick some other 
i?-words C*,.D*,.E* such that u>* = W* C*Wl D* W*'i?*W* is also an i?-word. Let 
w be the S'-word below this i?-word. Each x € Fix(S n ) such that x[0, \w\ — 1] = w 
has a unique preimage <p^ 1 (x) G Fix(i?™) and it satisfies Lp^ 1 (x)[0, \w\ — |W* | — 1] = 
W*C*WID*WIE*. Therefore i«p- l {x)[\W t C*\, \W,C t WiD t Wi\ - 1] = VFCW' 
for some T-word C. ip(p^ 1 (x) € Fix(T"). Moreover the map ip is injective on 
Fix(i?™) fl [w*] as w* projects in T to a magic word. 

Thus the number of n-periodic sequences in S which starts with w is a lower 
bound for the number of n-periodic sequences in T which starts with W'CW ' . The 
growth rates of these numbers are the entropies. Thus h(S) < h(T). By symmetry, 
h(S) = h(T). 

Let K be the image of ip in S and let K 1 be the image of ip in T. Let 
7 : K — > if' be the bijection 7 = i/V -1 - For any n, the set K n (W) of points in 
S which see W in at least n negative coordinates and n positive coordinates is a 
dense open set, and K contains the intersection K(W) of the K n (W). Thus K is 
a residual subset in S, as is K' in T. 

Q With respect to the measure 1 sets K(W)r\j- 1 K(W) and ~/K(W) nK(W), 
the map 7 is a magic word isomorphism between corresponding measures with full 
support. 

(jsU Let Cw = {x : x[0, \W\ — 1] = W}. Let /x be ergodic with full support; then 
fj,Cw > 0, and by ergodicity fx{K{W)) = 1, so fJ,(K) = 1 and /U G M(K ). Suppose 
[/ is an open set in S; then UT\K is open in if and = fj,(KnU) = /i' (7 (K fit/)). 
Modulo null sets, non-empty open sets of if' contain such non-empty sets ^{K C\U). 
It follows that /i' = 7/1 is a fully supported measure when /1 is. The converse is 
shown similarly. 

J3J| 7 takes the measure of maximal entropy /i of S (which has full support 
|21j ) to a measure /1' of T which is isomorphic and therefore of maximal entropy 
(recall h(S) = h(T)). Because S is SPR, by Proposition 12. 'S\ (S. u) is exponentially 
recurrent. By |44|. exponential recurrence is invariant under finitary isomorphism, 
so fi' is an exponentially recurrent measure of maximal entropy for T. We conclude 
from Proposition O that T is SPR. 

By Proposition ^. 31 the entropies of measures such that (i(C\y) = are bounded 
away from h(S), hence M(K) contains all ergodic and invariant measures with 
entropy close to h(S). The same holds for M(K'). By (J2J, 7 is an entropy- 
conjugacy. □ 

Remark 3.5. Suppose S, S' are SPR Markov shifts; /x,/i' are ergodic measures with 
full support; and 7 : (S 1 , /i) — > (S 1 , /i') is a magic word isomorphism. The previous 
proof shows 7 must be the restriction of an entropy-conjugacy between S and S' . 
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Remark 3.6. For positive recurrent shifts which are not SPR, the work of Ulf Fiebig 
|18j shows that recurrent rate invariants of finitary isomorphism are much finer and 
richer. These also give invariants of almost isomorphism. 

Proposition 3.7. The following are true. 

(1) Suppose R — > S and S — > T are injective one-block codes between Markov 
shifts, each of which has a magic word. Then their composition is an in- 
jective one-block code with a magic word. 

(2) Suppose o~f is the loop system built from first-return loops to a vertex for a 
Markov shift Sa- Then the natural inclusion o~j^> Sa is a one-block code 
with a magic word. 

(3) If R — > S, R — ► T is an almost isomorphism between S and T, and 07 
is a first-return-to- a-vertex loop system for R, then there is also an almost 
isomorphism o~f — > S, o~ f — > T. 

(4) Almost isomorphism defines an equivalence relation on Markov shifts. 

Proof. We leave the proof of Proposition 13.71 as an exercise. For the last claim 
(which we do not use in our proofs), the nontrivial step is to show, given Markov 
shifts i?i,i?2,7 1 and injective one block codes with magic words <p : Ri — ► T, (p% : 
i?2 — » T, that there is a Markov shift S and injective one block codes p\ : S — > R\, 
P2 ■ S — > i?2- For this one can use a standard fiber product construction pTTl 13*5] . 
as follows. Let S' be the subset of R\ x R% which consists of points (x, y) such that 
ipix — <p2y, and define p\ : [x,y) i— » x and p2 : (x, y) i— > y. Let S be the unique 
maximal irreducible Markov shift contained in S' which has dense image in Ri and 
in i?2. It can be verified that S,p±,p2 meet the requirements. □ 

Within a class of almost isomorphic Markov shifts, one can freely localize to 
loop shifts and still capture all ergodic phenomena with full support. However, if 
a Markov shift S is not SPR, the localization may miss many periodic points and 
measures of large entropy, as the next example shows. 

Example 3.8. Suppose 07 is not SPR. Then 03 there is a loop I of 07 such that 
the subsystem T which misses £ has h(T) — h(S) — log A. Pick a vertex on I 
(other than the base vertex for a/) and let a g be the loop shift based at this vertex. 
We have the obvious injective block code o~ g — > 07 with a magic word. However, 
the supremum of the entropies of ergodic measures assigning measure one to the 
complement of the image is log A. Therefore, the inclusion o~ g — * 07 is not an 
entropy-conjugacy. 

4. Zeta FUNCTIONS 

Suppose T is a bijection of a set, T : X — » X, such that for all n > the 
cardinality of the fixed point set of T n (i.e., |Fix(T")|) is finite. Then the Artin- 
Mazur zeta function of T is defined as 

n 

Ct(z) :=exp£|Fix(T")|- . 

* — ' n 

n>l 

Let O n (T) denote the set of orbits of cardinality n of T. We have the well known 
product formula 

00 j 
(4.1) CtW = (11(1 - Z »)I°»(T)I)- . 

n=l 
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This formula is easily verified when T contains just one finite orbit, and then 
holds at the level of formal power series by taking countable products. (Note, 
the zeta function and product formula make perfectly good sense at the level of 
formal power series even when lim |Fix(T")| 1 / Tl = oo.) For a loop shift at, with 

f( z ) = Y,n=l /« z "' we have 

(4.2) c /W = _^_. 

We do not know the earliest reference for (|4.2() ; it appears in |57] and in the case 
f(z) is a polynomial it is a special case of a result proved in the "bigamy" paper 
P]. For a detailed proof of a generalization of (|4.2|) . see |24l Theorem 8.2]. We 
remark, 14.2fl in general follows from the case that f(z) is a polynomial, because 
coefficients of degree at most n in the series on either side of (|4.2|l match what would 
be computed for the loop shift for the truncated series fiz + f 2 z 2 + --- + f n z n . 

A Markov shift T may fail to have a well defined zeta function in that |Fix(T™)| 
may be infinite for some or all n. To avoid this problem at least when the entropy 
is finite, we use, as in a local zeta function, i.e., the zeta function of the 

loop shift given by the return loops to a given vertex s. For a finite entropy Markov 
shift, for every n there are only finitely many first return loops to s of length n, so 
the local zeta function must be well defined. 

Proposition 4.3. Let Uf be a mixing SPR loop shift with finite positive entropy 
log(A). Let C(z) denote its zeta function ^ af (z). Then £(z) is holomorphic in 
\z\ < 1/A and has a meromorphic extension to a larger disk (all our disks are 
centered at zero). More precisely, (1 — \z)£(z) is holomorphic and nonzero in some 
disk \z\ < r with r > 1/A. 

Proposition 14.31 is in the treatise of Gurevich and Savchenko j^J Proposition 
9.2]. We give a proof for completeness. 

Proof: By definition of SPR, / is holomorphic on a disk of radius strictly greater 
than A -1 . Its power series has no constant term, hence it is not constantly equal to 
1. It then follows from (|4.2|l that has a meromorphic extension to this disk with 
poles exactly at points z where f(z) = 1. Let also denote this meromorphic 
extension. 

Now /(A -1 ) = 1 as of is not transient. Hence A -1 is a pole of CO 2 )- As f(z) is a 
power series with non-negative coefficients not all zero, /'(A -1 ) > 0. In particular, 
/'(A -1 ) is non-zero and the pole at A -1 is, as claimed, a simple pole. 

The mixing assumption implies that there are no other poles on the circle \z\ = 
A -1 . Indeed, let \z\ = A -1 . We have 



n>l 



<^/„|z"| = l 

n>l 



and the inequality is an equality iff z n = \z n \ = A~™ for all n > 1 such that /„ ^ 0. 
Such a z must be tuX^ 1 with to a root of unity of some order q > 1 such that 
/„ => q\n. Therefore q divides the period of tr/. But <jf is mixing and 
therefore this period is 1. In particular, q = 1 and z = A -1 . By compactness, 
z = A -1 is the only pole of ((z) on some disk of radius > A -1 . □ 

The following corollary will be essential to the proof of our main result (|6.4(l . 
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Corollary 4.4. Let a / and a g be mixing SPR loop shifts of equal finite entropy 
log(A) > 0. Then 

l/n 

Urn \O n (a f )\ - \O n {a g )\ <A. 

Proof: Set a n = | Fix(( CT/ )")| - | Pix((a 9 )™)| and b n = \O n {o f )\ - \On{a g )\. With- 
out loss of generality, we assume b n ^ for infinitely many n, so lim |&.„| 1/Ti > f 
and limlanl 1 /" > 1. Because a n = X)fc|n ^fc, and by Mobius inversion nb n — 

J2k\n f i ( n /^) a k, it then follows that lim l^nl 1 /" = lim lanl 1 /™. 
By Proposition ^. 31 



exp^(|Fix((<7 / n|-|Fix(Kr)| 



z" 
n 



is a holomorphic function on a disk of radius r > 1/A, and consequently so is 
Y^{a n /n)z n . Therefore limjanl 1 /™ < A, which implies lim^l 1 /™ < A. 

□ 



5. The Loops Lemma 

Given a power series k with zero constant term, we use the standard notation k* 
to denote the power series 1 + k + k 2 + ■ ■ ■ = 1/(1 — k). Similarly, given a set K of 
words, containing only finitely many words of any fixed length, we let K* denote 
the set of words which are concatenations of words in K, and we include in K* the 
empty word, which we regard as an identity element for concatenation. 

The construction of the next lemma is lifted from (SHj . 

Lemma 5.1. Suppose f = h + k where h and k are formal power series in z1+[[z\\ 
which are not identically zero. Then 1 — hk* = (1 — f)/(l — k), and there is an 
infective one-block code ip from o~hk* to Of which has a magic word. If additionally 
h{(Jk) < h(af), then ip is also an entropy- conjugacy. 

Proof: The equality is an application of the geometric series. 

We have h = Y^n=i h n z n and k = Y^n=i k n z n ■ Let Gf be a loop graph for 
f = h + k, with C the set of (first return) loops of Q. Choose subsets H and K, of C 
such that C is the disjoint union of TL and /C; and for every n, the number of loops 
of length n in TL is h n ; and for every n, the number of loops of length n in /C is k n . 
Let £' be the loop set of a loop graph Qhk* for the series hk* = h(l + k + k 2 + ■ ■ ■ ). 
Choose a bijection [3 ; £ — * TLJC* respecting word length and let (3 define a labeling 
of and thereby a one-block code ip from a^- to 07. Each loop I of £' has a 
label / which has a nonempty prefix which is an element of TL, and no word of TL 
occurs in I except as a prefix. Consequently any word of TL is a magic word for ip. 

It follows in particular that ip is an isomorphism w.r.t. any ergodic and invariant 
measure, unless the measure avoids all words of TL and thus lives on a^- The 
entropy of such a measure is bounded by h(<7k), so cp is an entropy-conjugacy if 
h(a k ) < h{a f ). □ 

The rest of this section is devoted to the following lemma, which records the key 
features of our main construction. 

Lemma 5.2 (Loops Lemma). Let Of be a mixing loop shift, f — Y] 1 f n z n , with 
< h(o-f) = log(A) < 00. Let {rk)^ =1 be a nondecreasing sequence of positive 
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integers 1 . Let i?„ = #{fc : r k — n}. Set / <1> = /. Given f <k> , define the series 
j<k+i> = y^ =i f<k+i> z n by the equa Uon 



OO 



J<k + l> {z) ^ { j<k> {z) _ z r k) ^ z , 

n=0 



or equivalently 



i- /. i • , •, 1 f <k> ( z ) 



f <k+L> (z) 



Assume the pair (/, (^sOfcLi) satisfies the positivity condition 

(5.3) f< k> > l , ken 

(which guarantees every f <k> (z) £ zZ+[[z]]). Because lim^ — oo, it follows that 
for every n in N, the sequence {fn k> )kLi * s eventually constant. Define fn°° > = 
lim fe /< fc> and f <oc> = E,T=i °° > ^™- &e the series f <k> where k is 

defined by the condition rj < n if and only if j < k. 

Then there is a one-block code ip : cry <oc> — ► <jf with the following properties. 

(1) ip is injective. 

(2) For every n, 1 - = (1 - / (n) (z))/((l - z n ) R ^) . 

(3) 1 ~/<°°>(z) = (1 - f(z))/(I\Zi(l - ■ 

(4) 7/lim„(i?„) 1/n < A, then h(a f <oc>) = log(A). 

(5) If limn^n) 1 /™ < A, i/ien er^ is SPR if and only ifo~f«x> is SPR. 

(6) Suppose 

(5.4) i?„ < /„ for all n, and also R n < f n for n = r\ . 

Then the map ip can be chosen to have a magic word. 

(7) Suppose in addition to H5.4(l that Um n (i2 n ) 1 / n < A. Then the map ip can be 
chosen to have a magic word and also to be an entropy- conjugacy. 

Proof: Construction of Q <k> . Inductively, we will define a sequence of labeled 
graphs Q <k> , such that the underlying graph Q <k> is a loop graph for f <k> , the 
labeling set is the edge set of Q <1> , and the labeled graph Q <k> defines a one-block 
code ipk ■ crf<k> — ► <jf (here, (<Pky)i is the label of the edge j/j). We let £ <k> denote 
the set of (first return) loops of Q <k> , and we let £ <k> denote the labeled loops, 
from Q <k> . 

If Z is a path l\ ■ ■ ■ lj of j edges in a labeled graph, then we denote by I its label, 
which is a word h ■ ■ ■ lj, where U is the label of U. Similarly when we label a path 
If • ■ lj with a word w± ■ ■ ■ Wj, we are defining U = mj. 

Choose Q <ly to be a loop graph for / = / <1> . Define Q <1> by giving each edge 
a distinct label. Given Q <k> , we construct g <k + 1> from Q <k> as follows. (1° will 
denote the empty word, so cl° — c.) 

• Choose a loop l <k> — I of length r k in Q <k> (at least one such loop exists 
by the positivity condition (|5.lip 1. 

• Now Q <k+1 > is the labeled loop graph with exactly the following set of 
labeled loops: for every loop c in C <k> \ {I}, for n = 0, 1, 2, 3, ... , put in 
g<k+i> Qne j QO p £^ c ^ n ^ Q £ i en gth | c | + nr k , with label cl n . 



rj, will be the length of the orbit to be deleted at stage k. 
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The rule £(c,n) i— ► cl n defines a map £< k + 1 > — » (£ <fc> )* which determines a one 
block code ik+i '■ a /<fc+i> — * o~ /<*> which respects the labeling. Therefore the map 
ipk+i is the composition of Lk+i, ■■ ■ ,t>2- Clearly /„ fc+1> = fm h> — 1 if m = rjt, and 
/ J ^' c> = fm h+1> if m < rfc. Each sequence (fn k> )T=i ' IS eventually constant and 
the positivity condition l|5.3[) does guarantee each / > G zZ + [[z]]. Moreover, the 
set of labeled loops in Q <k> of length n or less does not change once > n. We 
define g <co> to be the labeled loop graph whose labeled loops of length n are those 
in all but finitely many of the Q <k> . Then g <co> is a loop graph for f <cc> and 
the labels of Q <ca> define a one block code ip : cr/<oo> — > cr/. 

Injectivity. For every k in N, the map ik+i ■ &f<k+i> — > <7f<k> is injective. 
(The map ifc+i is not surjective; a point w of o~f<k> fails to be in the image of 
of Lk+i if and only if to is forwardly or backwardly asymptotic to the periodic 
orbit ■■■l< k >l< k >l< k > ■■■ .) It follows for 1 < k < oo that tp k : a f <k> -> cr/ 
is injective, with Image (i£>i) D Image(<p2) D Image(y>3) 3 ••■ . For x G cr/ and 
1 < A; < oo, define Vfe(a;) to be the set of integers i such that there exists y G cr /<<=> 
such that ifik(y) = x and the edge begins at the base vertex of Q <k> . (Here 
'Pea = </?•) The injectivity of y>£ and the structure of iu show for any given x that 
V\{x) D V^z) D Vs(x) ■ • ■ . A point x of cr/ will have a preimage under tp in cr/<oo> 
if and only if ni<fe <00 Vfe(a;) is unbounded above and below, in which case 

(5.5) Vooix) = p| V k (x) . 

l<fe<oo 

Because Q <co> inherits from the Q <k>, s the property that distinct loops have 
distinct labels, this implies that ip is injective. 

Zeta function and entropy. For each n, if > n then the number of loops 
of length n or less in Q <k> is the same as in g <00> . At any finite stage k, 
\O n (crt<k+i>)\ = \O n (<jf<k>)\, except for the R n values of k at which = n; 
at each of these values, \O n (o-f<k+i> )| = \O n (at<h>)\ — 1. This description proves 
(J2J and also that \O n (aj<^>)\ = |0 n (cr/)| — R n , for all n. Consequently, using the 
product formula for the zeta function, we also have 



nr=i(i-^) fl " nr=i(i-^) fl " 

which proves ®- By the same product formula, 



(!-*»)*») =cxp^-Q„z":=g(z) 



n 
i 



where Q„ = J2k\n an< ^ therefore lim(Q n ) 1 /™ = lim(i?„) 1 /™ = l/rad(g). Finally, 
given lim(i?„) 1 /™ < A, the claim J3J follows from 

/ \ l/n 

h(a f )= limlog /n = li m log ( |0 n (oy)| - i?„ ) 

= limlog |C„(cr/<oo>)| 1 /" = /l(CT/<oo>) . 

Strong positive recurrence. We have rad(/ <00> ) = min{rad(/), rad(c/)}. If we 
assume lim„(-R„) 1 ' n < A, it follows that rad(g) < 1/A and 

rad(/ <00> ) > ~ <=> rad(/) > I . 
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Therefore <tj<»> is SPR if and only if <jf is SPR, proving (jjjj. 

A magic word. Consider the inductive construction in terms of the sequence 
{§ <k> ), k = 1,2,... . The loops oiQ <k+1> are loops £(c,n) and the label of £(c,n) 
in Q< k + l > is cfr. 

Assume (|5.4ll holds. Let N — r\, so, the first deleted loop Z <:> will have length 
N . Pick a loop W of length N. At this initial stage, in Q <1> every loop is labeled 
by its own name, and symbols of W occur in labels of no loop other than W itself. 
So, for Q = Q <1> , we have the following 

Condition 5.6. A symbol of W can occur in a label I of a loop I in Q only as part 
of an initial segment of I which equals W 

Now suppose that (|5.ti|) holds for Q = Q <k> , and the loop l <k> has the property 
that no symbol of W occurs in l <k> . It then follows from the construction that (|5.(jl) 
holds for Q = g <k+1> . However, because (|5.4ll holds, for each k we can pick for 
l <k> a loop which already exists in Q <1:> and which is not W, and which therefore 
has a label using no symbol of W . 

Inductively, then, we construct the Q <k> to satisfy 1)5. 6)1 . and in the limit obtain 
g<cc> rp ne ma p ^ j g jjjg one _bi oc k; code from af<aa> to 07 defined by the labeling 
of loops in g <00> . We claim that W is a magic word for ip. To see this, first note 
that the labeling of loops in g <00> inherits the following properties from the Q <k> : 

(1) distinct loops have distinct labels 

(2) a symbol of W can occur in a label I of a loop I in Q <ca> only as part of 
an initial segment of I which equals W 

(3) if WDW is a word in aj, then WD is the label of a unique concatenation 
of loops in g <00> . 

Notice that if (3) did not hold, then ip would not be injective, contrary to what we 
established. 

It follows from property (3) and the remark before (|5.5() that whenever a point 
x of 07 has the property that the set {n : x[n,n + \W\ — 1] = W} is unbounded 
above and below, then a preimage of x can be constructed in <7/<°o> . 

Finally, suppose x[i,2|VF| + \D\ — 1] = WDW and tpy — x. It follows from (2) 
that both of the edges y[i) and y[i + \W\ + \D\ — 1] have as initial vertex the base 
vertex of the loop graph Q <00> , and therefore y[i, \ W\ + \D\ — 1] is a concatenation 
of loops in g <ao> . By (3), this concatenation is unique. Therefore, W is a magic 
word for ip, proving JBJ- 

Entropy- conjugacy. We remark that when S and T are SPR, it follows already 
from 1)3.5(1 that the map ip provided by JHJ is an entropy conjugacy. We now proceed 
to the general case, assuming and lim„(i?„) 1 /" < A. 

Let an be the loop shift corresponding to R(z) — Y^=i^nZ n - Appealing to 
(15.41) . we regard cjr as a subsystem of 07. We construct the map ip as above, with 
the following additional constraint: for every k, the deleted loop l <k> is chosen to 
be a loop of or. As in the argument for ©, it follows that any loop W of 07 which 
is not a loop of o~r must be a magic word for ip. 

As ip is injective, it is a measure-preserving isomorphism (<7/<oo> , v) — > (07, tpv) 
for any invariant probability measure v of 07 < 00 >. Let us consider an ergodic 
and invariant probability measure /1 for 07 such that ip^ 1 fails to be a.e. defined 
(otherwise ip : (af<^>, <^ _1 /i) — > would be an isomorphism). If is a loop 
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of 07 which is not a loop of <7r, then fi{x : x[0, \ W\ — 1]} must be zero. Therefore 
li is supported on or. 

It is enough now to prove H{or) < h(af). Assume first that or is not SPR. 
Then h(ajt) = loglim n i?y n < h(of), proving the claim in this case. Assume now 
that or is SPR. Pick some no such that R no < fn an d let R' n = R n for all n ^ uq 
and R' no = R na + 1. Clearly limni?^ 1 ^" = \\m n Rl/ n < h(oR,) < h(oR>). It follows 
that ori is SPR, hence by of Proposition 12.31 h(<jR) < h(oR>) < h(of). This 
implies the claim. 

□ 

6. Main Results 

Lemma 6.1. Suppose op and oq are mixing loop shifts of equal entropy log(A) > 0, 
and 1 < [3 < A. Then for any sufficiently large N € N, there are loop shifts a t and 
ig such that 

(1) There are injective one-block codes o~f — » o~p and a g — > erg , each of which 
is an entropy- conjugacy with a magic word. 

(2) \O n (o f )\ = \O n (a g )\ =0forn<N , 

(3) \O n {a f )\ - \O n (a g )\ = \O n (a F )\ - \O n {a G )\ , n > N, and 

(4) min{/„, 5 „}>/3" , n > N . 

Proof: Because (3 < X and 

lim \O n (a F )\^ n = A = lim P^g)^ 11 

71 n 

for TV sufEcicntly large we may assume 

n>N mm{\O n (a F )\,\O n {a G )\} >2\f3 n ] , 

n 

where [/?"] denotes the integer ceiling of f3 n . Fix such an N. Beginning with f = F, 
we apply Lemma |5.1l repeatedly with k = z n , where n is the length of a shortest 
loop, until all loops of length less than N have been deleted. This produces a series 

00 

F = Fm 11 

such that a-p — > of is an injective one-block code which has a magic word and is 
an entropy-conjugacy (indeed, h(o z ™) — 0). We also have: 

l-F = (l-F)/( Yl (l- z n )\°n^\\ 

l<n<N 

=u-f) n (f,*"T'""- 

l<n<N k=0 

Now \O n {o T )\ = \O n {o F )\ > 2\P n ] for n > N, and F n = \O n (o T )\ for N < n < 
2N. Let b n — [/?"] and define the series (polynomial) 

2AT-1 

b(z) = J2 6 « z " 

n=N 

(so, F n - b n > f3 n for N < n < 2N). Set h = F - b and define / = Kb* = 
h(l+b+b 2 +- ■ ■). By Lemma l^Tl there is an injective one-block code Of — ► o-p which 
has a magic word and is an entropy-conjugacy. Indeed, h(ob) < h{oF) — h(oj) as 
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b defines a finite state Markov shift whose entropy must increase when adding any 
of the remaining loops of of. Also, the choice of b implies f n > j3 n for n > N. 

Similarly construct G from G, and for b(z) exactly as above set k = G — b and 
g = kb* . We have proven © and To prove J2J), we compute 

\O n (o f )\ - \O n [o a )\ = {\O n (o T )\ - \O n (o b )\) - (\O n (o^)\ - \O n (o b )\) 

= \O n {o T )\-\O n {o^)\ 

= \O n (o F )\ - \O n (o G )\ , n>N. 

□ 

Theorem 6.2. Suppose of and oq are loop shifts of equal period and equal entropy 
log(A) (possibly infinite), and 

W\O n {o F )\-\O n {o G )\\ 1,n < \ . 

n 

Then of and oc are almost isomorphic, by an almost isomorphism which induces 
an entropy- conjugacy of op and o G . 

Proof: Without loss of generality suppose A > 1. To begin, suppose the period 
is one, i.e. op and o G are mixing. Let 7 = lim | \O n (op)\ — lOn^oa)]] 1 ^ ■ Choose 
(3 such that 7 < (3 < A. Choose /, g and N to satisfy the statement of Lemma lb. II 
for these [3, A, F and G, with N large enough that 

(6.3) n> N => \\O n (o F )\ - \O n (o G )\\ < [3 n - 1 . 

It suffices now to prove the theorem with o / and o g in place of of and oq ■ 

We will apply the Loops Lemma 15.21 to 07 and to o g , using sequences (R n ) 
defined for / and for g as follows: 

R n (f) = max{0, \O n {o f )\ - \O n {o g )\} , 
R n (g) = max{0, \O n (o g )\ - \O n (o f )\} . 

These sequences (R n ) determine the corresponding sequences (r^) of the Loops 
Lemma, as well as Loops Lemma sequences of functions f^ and g^ n \ the corre- 
sponding graphs g( n )(f) and G^ig), and the limit functions f <oc> and g <oa> . 
Recall /„ and g^ denote the number of (first return) loops of length n in Q^ n \f) 
and G^(g), respectively. Using J5J of the Loops Lemma, it is easy to verify by 
induction that for every n, 

• for k < n, the number of loops of length k in Q^(f) and Q^ n \g) is the 
same, and 

• fi n) -g { n n) = \O n (o f )\-\O n (o g )\ . 

It follows that f <oa> — g <oa> . Consequently, there is a map Of<^> — > o g <^> 
which is a one-block code renaming edges, with inverse likewise a one-block code 
(here all nonempty words are magic words). By symmetry, it now remains to show 
that the injective one-block code Of«x>> — > Of of the Loops Lemma can be chosen 
to have a magic word and to be an entropy-conjugacy. 

Applying the definition of R n (f ), Lemma f6.1l|3*|) . the bound l|6.3|) . and Lemma 
I6.1ljl|) . we have for n > N that 

R n (f) = max{0, \O n {o f )\ - \O n (o g )\} = max{0, \O n (o F )\ - \O n (o G )\} 

< P n ~ 1 < /„ - 1 ■ 
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Moreover, by Lemma lfTTlffil . the length n of the first loop of 07 deleted in the Loops 
Lemma construction is equal to N. Therefore conditions J§} and JJJ) of the Loops 
Lemma are satisfied, and o~t<oa> — > o~t can be chosen to be an entropy-conjugacy 
with a magic word. By symmetry, we obtain an entropy-conjugacy with a magic 
word a j — > a g . This finishes the proof in the mixing case. 

Now suppose that up and tq have period p > 1 . Write the F, G in the form 

oo oo 

F(z) = J2 am pn , G(z) = b nz m ■ 

n—1 n—1 

Define a = ^2 a n z n and b = b n z n . The loop shifts cr a and o~b are mixing and by 
the previous argument there is a loop shift er c , c = ^2 c n z n , with injective one-block 
codes <7 C — * er a and a c — > ct;, with magic words. Let ff(z) = ^c„z p ™. It follows 
easily that there are injective one-block codes an —> o~f and <Jh — * 0\f with magic 
words. □ 



Theorem 6.4. Suppose Sa and Sb are Markov shifts, and Sa is SPR with finite 
entropy. Then the following are equivalent: 

(1) Sb is SPR, Ii(Sa) = It-(Sb) and period^^) = period(S , s). 

(2) Sa and Sb are almost isomorphic. 

(3) Sb is SPR, Sa and Sb are entropy-conjugate. 

Moreover, under these conditions there exists an entropy-conjugacy which is induced 
by an almost isomorphism. 

Proof: J3J) => 0. The almost isomorphism induces a bijection respecting Uni- 
tary isomorphism between the ergodic measures of Sa and Sb with full support. 
Therefore Sb has a measure of maximal entropy [1b, and there is a Unitary iso- 
morphism (Sa^a) — y {Sbt^b)- Because exponential recurrence is an invariant 
of Unitary isomorphism, it follows from Proposition 12.31 that Sb is SPR. The en- 
tropy and period of Sa and Sb are already invariants under measure-preserving 
isomorphism of (Sa,^a) and {Sb,^b)- 
This last remark also proves (0 ==> 0- 

=> (J2J and JSJ). Pick a vertex v in the graph with adjacency matrix A which 
is used to define the edge shift Sa- Because Sa has finite entropy, for each n the 
number F n of first return loops to v is finite. Let F = J2n°=i^nZ n . The natural 
injection -» Sa is a one-block code with a magic word. Similarly choose oq for 
Sb- It suffices to prove the theorem with o~p and o~q in place of Sa and Sb- 

Let log(A) denote the entropy of of and <jq. Because Sa and Sb are SPR, it 
follows from Corollary 14 . 41 that 

lim"||0 n (cxF)| - |0 n (<r G )|| 1/n < A . 

Now Theorem 16.21 shows that o~f and oq are almost isomorphic, by an almost 
isomorphism which induces an entropy-conjugacy. □ 

Theorem 16.41 bees the following questions. 

Questions 6.5. Is SPR an entropy-conjugacy invariant? On the other hand, must 
positive recurrent Markov shifts of equal entropy and period be entropy-conjugate? 
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7. Application to other dynamical systems 

Given a measurable system T (as denned in (|3.2|l ). we begin by recalling the defi- 
nition of the natural extension of T (to an invertible system) . Let X denote the sub- 
set of X^ consisting of the bisequences (. . . Xq, X\, . . .) such that T{xi) = x i+ i 
for all i. Associated to X is the cr-algebra generated by requiring every coordinate 
projection to be measurable. Let T be the bijection X — > X defined by applying 
T coordinatewise. The projection of X onto the first coordinate defines a map 
which intertwines T and T (if T is invertible, then this coordinate projection is 
a bijection). This coordinate projection induces an entropy-respecting bijection 
A4(T) — > M(T) of invariant Borel probabilities. Often measurable phenomena are 
more conveniently studied on invertible systems, and the natural extension is used 
for this. 

Now supose 7 is an entropy conjugacy of invertible systems. We say 7 refines 
the hyperbolic structure if the entropy- negligible set of Definition 13 . 31 can be chosen 
such that for x, y in its complement, 

lim dist(7(T"a;),7(r™2/)) = => lim dist(T n x,T n y) = , and 
lim dist(7(T"x),7(T™2/)) = lim dist(T n x,T n y) = . 

n — 00 n — > — oo 

We say 7 preserves the hyperbolic structure if both 7 and its inverse refine it. 

Our next theorem is essentially an assembly of results from El El EI! > collected 
for the statement of Theorem l7.2l To prepare, recall some definitions. A piecewise 
monotonic interval map is a map / : [0, 1] — > [0, 1] such that there is a finite 
partition of [0, 1] into intervals on each of which the restriction of / is continuous 
and strictly monotonic. The map / is said to be completely discontinuous if the 
previously mentioned partition can be chosen so that all its endpoints except and 
1 are discontinuities of /. 

A multi- dimensional j3 -transformation [S] is a map defined byx^ B.x mod Z d 
for x G [0, l] d , d = 1, 2, . . . and B an expanding affine map of M. d . 

We will not define the class of subshifts of quasi finite type j^j; this is a useful 
class of finite state subshifts which contains the class of subshifts of finite type, but 
is much larger (for example, every finite positive entropy is achieved by a subshift 
of quasi finite type). 

Theorem 7.1. The following measurable dynamical systems have natural exten- 
sions which are entropy-conjugate to the disjoint union of finitely many SPR Markov 
shifts of equal entropy. 

(1) Subshifts of quasi finite type 

(2) Piecewise monotonic interval maps with non-zero topological entropy. 

(3) The multi- dimensional (3 -transformations 6 . 

(4) C°° smooth entropy- expanding maps including, e.g., smooth interval maps 
with non-zero topological entropy 5^ or (x, y) 1— ► (a — x 2 + ey, b — y 2 + ex) 
with a, 6 < 2 close to 2 and \e\ sufficiently small [7|E1- 

Moreover, the entropy- conjugacy from the natural extension to the SPR Markov 
shift can be chosen to refine the hyperbolic structure. In cases (1) and (3) and for 
completely discontinuous piecewise monotonic maps, the entropy- conjugacy can be 
chosen to preserve the hyperbolic structure 2 



! In cases (2) and (4), it is the hyperbolic structure of the symbolic dynamics which is preserved. 
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Proof sketch. (1) follows from Theorem 3]. 

(2) follows from Lemma 2]. 

(3) also follows from [S] Lemma 2]. 

(4) An argument similar to that in |5J Lemma 2] can be applied to G°° smooth 
entropy-expanding maps in arbitrary dimension (see [5]). More specifically, in the 
proof in section III of [B], the condition that /*(£*) is zero can be replaced by the 
condition that it is very small. Then we can conclude that the Markov shift is SPR 
as in [5] , Corollary at the end of section "Entropy at infinity" . 

The proof of the preservation of the hyperbolic structure is done in two steps. 
To begin, we recall that multidimensional /^-transformations and completely discon- 
tinuous piecewise monotonic maps have natural extensions that are topologically 
conjugate to those of their symbolic dynamics. These are subshifts of quasi-finite 
type (or Q.F.T.) as shown in So we can restrict to Q.F.T. 

We use terminology and facts from [5]. A Markov shift (the so-called complete 
Markov diagram) is built over the Q.F.T. Let 7 be the entropy-conjugacy of the 
original subshift into that Markov shift. 7 obviously refines the hyperbolic structure 
as the canonical partition of the Markov shift is finer than the original canonical 
partition, using the identification 7. 

On the other hand, for x in the Markov shift, xq is determined by the sequence 
( r y~ 1 x) n , n < 0. Thus the unstable relation is preserved by 7. 

One cannot replace n < by n > here. But we have the following property for 
a.e. point x in the Markov shift: x n is determined by the finite word ("fx)k, n — £ < 
k < n with £ — £(a n x). By stationarity £{a n x) /> 00 for a.e. x. Hence the 
future in the subshift determines x no for some no. But x no and the sequence 
( , y~ 1 x)k, k > hq determine Xk,k > no- Thus the stable relation is also preserved 
by 7. This concludes the proof for the case of Q.F.T. □ 

Combining Theorem 17 . II with our main result, Theorem 16. 41 we obtain 

Theorem 7.2. A system in any of the classes listed in Theorem \ 7.1\ has finite 
entropy and only finitely many ergodic measures of maximal entropy. Two systems 
from the list have natural extensions which are entropy conjugate if and only if they 
have equal entropy and for each p £ N the same number of ergodic maximal entropy 
measures of period p. 

Corollary 7.3. Two topologically mixing piecewise monotonic interval maps are 
entropy conjugate if and only if they have the same entropy. 

Proof. A topologically mixing piecewise monotonic map has a unique measure of 
maximal entropy, and this measure is mixing |26| . So, in this case the invariant 
above simplifies to the entropy of that measure, which is the topological entropy 
by the variational principle [HH]. □ 

8. Remarks 

We continue our convention that "Markov shift" means "irreducible Markov 
shift" , even in the finite state case. 

Remark 8.1 (Magic words). Magic words come from the finite state coding theory 
of Markov shifts |31II35| . They play a basic role in the structure theory of finite-to- 
one codes between finite-state Markov shifts (where they are defined more generally 
for maps which are finite-to-one but not necessarily one-to-one almost everywhere) . 
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In the past, a major step in some code constructions was to show choices could 
be made to guarantee existence of a magic word, so that the resulting code would 
be one-to-one almost everywhere. (For example, the construction of a magic word 
is the essential difference between PP and Ashley's Replacement Theorem 

|31M35| has largely eliminated this step for codes of finite state systems. 

Remark 8.2 (Almost conjugacy). |T1 1311 135] Two finite state Markov shifts S, T are 
almost conjugate |35| , equivalently almost topologically conjugate ^ I31| , if there is 
another Markov shift R and surjective block codes R — + S, R — > T each of which 
are one-to-one almost everywhere. The Adler-Marcus Theorem £Q is that entropy 
and period are complete invariants of almost topological conjugacy for finite state 
Markov shifts. (Here each of the maps R — > S , R — > T must have a magic word, and 
the content of "a.e." contains no more information than we have in Propositions 
OandEl) 

The classification of countable state Markov shifts up to almost conjugacy is 
extremely difficult and remains an open problem |12j . Doris Fiebig has the most 
general result: for Markov shifts S,T of equal finite positive entropy, there is a 
Markov shift R of equal entropy and countable-to-one biclosing continuous surjec- 
tions R — > S, R — > T. Here "biclosing" is a bonus and it is impossible to replace 
"countable-to-one" with "uniformly finite-to-one" (T2|. Note, this result places no 
restriction on the relative classes of S and T (which can be transient, null recurrent, 
positive recurrent or strong positive recurrent, each independent of the other). 

Remark 8.3 (Almost isomorphism). Our definition of "almost isomorphism" is anal- 
ogous to the definition of "almost conjugacy"; for countable state Markov shifts, 
it seems more tractable and perhaps better suited to studying invariant measures. 
Injective block codes between finite state Markov shifts of equal entropy must be 
homeomorphisms, so "almost isomorphism" does not appear as a meaningful dis- 
tinct relation in the finite state category. 

Remark 8.4 (Other Markov measures). The Adler-Marcus Theorem gave good fini- 
tary codes between finite state Markov shifts of equal entropy and period. By a 
Markov chain, we will mean a Markov shift together with an invariant Markov mea- 
sure. The subsequent search for good finitary codes between finite state Markov 
chains has a sizeable history (see [H3 EH1 E2 ESI E3 1^ ES] and their references); 
for years, there was an attempt to produce general codes by way of an almost 
topological conjugacy. Marcus and Tuncel destroyed these hopes with their weight - 
per-symbol polytope invariants The paper [21] of Mouat and Tuncel effectively 
exploits one alternate approach, using ordered algebra. The "loops method" of this 
paper is another approach; we hope it will be of use for Markov chains beyond 
the case of the measure of maximal entropy. Both the Mouat-Tuncel construc- 
tion and the Loops Lemma method yield magic word isomorphisms when they 
work. We expect that the known invariants (ratio group |33| . weights group with 
distinguished coset beta function [5JI|) of hyperbolic-structure-preserving iso- 
morphism |54l 155) with finite-expected coding time will be the only obstructions 
to the existence of a magic word isomorphism between finite state mixing Markov 
chains. 

Remark 8.5 (Local compactness and loop shifts). Where we use "irreducible Markov 
shift" , Gurevich and Savchenko [21] use "indecomposable symbolic Markov chain" . 
Their use of "symbolic" rather than "topological" was to emphasize that topological 
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issues are not always paramount. The following simple construction illustrates this 
point in our context. 

Let 07 be any loop shift such that /(l) = oo (i.e., / is not just a polynomial). 
Then the space aj will be neither locally compact nor er-compact. The loop shift 
is still closely related to a locally compact system. The graph in Figure 1 describes 
the construction of a locally compact Sa from /, such that the natural injective 
one block code c/ — > Sa has many magic words and induces a bijection of spaces 
of invariant Borel probabilities, Ai(af) — > A4(Sa)- 




Figure 1. 
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